Basic Level Categorization Facilitates Visual Object Recognition

نویسندگان

  • Panqu Wang
  • Garrison W. Cottrell
چکیده

Recent advances in deep learning have led to significant progress in the computer vision field, especially for visual object recognition tasks. The features useful for object classification are learned by feed-forward deep convolutional neural networks (CNNs) automatically, and they are shown to be able to predict and decode neural representations in the ventral visual pathway of humans and monkeys. However, despite the huge amount of work on optimizing CNNs, there has not been much research focused on linking CNNs with guiding principles from the human visual cortex. In this work, we propose a network optimization strategy inspired by both of the developmental trajectory of children’s visual object recognition capabilities, and Bar (2003), who hypothesized that basic level information is carried in the fast magnocellular pathway through the prefrontal cortex (PFC) and then projected back to inferior temporal cortex (IT), where subordinate level categorization is achieved. We instantiate this idea by training a deep CNN to perform basic level object categorization first, and then train it on subordinate level categorization. We apply this idea to training AlexNet (Krizhevsky et al., 2012) on the ILSVRC 2012 dataset and show that the top-5 accuracy increases from 80.13% to 82.14%, demonstrating the effectiveness of the method. We also show that subsequent transfer learning on smaller datasets gives superior results.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Integrating Randomization and Discrimination for Classifying Human-Object Interaction Activities

Psychologists have shown that the ability of humans to perform basic-level categorization (e.g. cars vs. dogs; kitchen vs. highway) develops well before their ability to perform subordinate-level categorization, or fine-grained visual categorization (e.g. distinguishing dog breeds such as Golden retrievers vs. Labradors) [18]. It is interesting to observe that computer vision research has follo...

متن کامل

Subordinate-level Object Recognition 1 Running Head: Face and Object Recognition Does Visual Subordinate-level Categorization Engage the Functionally-deened Fusiform Face Area? Subordinate-level Object Recognition 2

Functional magnetic resonance imaging was used to compare brain activation associated with basic-level (e.g., BIRD) and subordinate-level (e.g., EAGLE) processing for both visual and semantic judgments. We localized the putative face area for eleven subjects, who also performed visual matching judgments for pictures and aurally-presented words. The middle fusiform and occipital gyri were recrui...

متن کامل

Basic Level Categorizatiandon is Important: Modelling the Facilitation in Visual Object Recognition

Recent advances in deep learning have led to significant progress in the computer vision field, especially for visual object recognition tasks. The features useful for object classification are learned by feed-forward deep convolutional neural networks (CNNs) automatically, and they are shown to be able to predict and decode neural representations in the ventral visual pathway of humans and mon...

متن کامل

4 Degrees of Expertise

Visual object recognition and categorization are fundamental abilities required for successful negotiation of the visual world. Humans effortlessly classify and recognize objects and faces within busy scenes, thousands of times a day. Thus, understanding how perceptual categorization and learning occur and how such seemingly complicated computations are implemented in brain processes is an impo...

متن کامل

Levels of categorization in visual recognition studied with functional MRI

Background: Recent functional neuroimaging results implicate part of the ventral temporal lobe of the brain in face recognition and have, together with neuropsychological findings, been used as evidence for a face-specific neural module in the brain. Experimental designs, however, have often failed to distinguish between the class of the object used as the stimulus (face or non-face) and the le...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1511.04103  شماره 

صفحات  -

تاریخ انتشار 2015